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Various sequential sampling plans based on 
Wald's Sequential Probability Ratio Test 
(SPRT) {Wald 1947, Wetherill 1966) have been 
developed for sampling forest populations. Such 
plans have been used to test hypotheses for 
decisionmaking and to classify populations. At 
least 27 plans have been developed to aid in mon- 
itoring insect populations or their damage. 

Some plans were based on one SPRT to yield a 
two decision procedure such as control versus no 
control, while others were based on two SPRT's 
to yield a three decision procedure such as light 
versus medium versus heavy infestations. Be- 
cause most sequential sampling plans in forestry 
have been applied to the field of entomology, 
this paper will emphasize entomological ex- 
amples. 

The sample size needed to make a decision to 
accept or reject a hypothesis for sequential 
sampling plans based on Wald's SPRT is a 
random variable. A decision to accept or reject a 
hypothesis or to continue sampling is made after 
each observation is taken, and observations are 
taken until enough evidence has been collected 
to make one of thp terminating decisions. Such 
plans usually require only 40 to 60 percent as 
many observations as an equally reliable fixed 
sample size procedure. They are intuitively ap- 
pealing in that few observations are needed to 
make a terminating decision when, for example, 
insect populations are sparse or abundant. Given 
the budget restraints faced today, sequential 
sampling plans should find wide applicability 
where we need to classify populations or com- 
pare populations with some standard for de- 
cisionmaking purposes and observations are 



time consuming, costly, and/or destructive. 
Such plans would also be useful when it is impor- 
tant to make a quick decision. 

The first step in constructing a sequential 
sampling plan is to define the sampling unit and 
associated random variable of interest. The dis- 
tribution of the random variable must then be 
determined. All of the plans developed to sample 
forest populations are based on either the bi- 
nomial, negative binomial, normal, or Poisson 
distributions. For the normal and negative 
binomial distributions, the population variance 
o 2 and the clumping parameter K, respectively, 
are assumed known, and if unknown must be 
estimated, 

The second step is to set the class limits (e.g., 
economic thresholds or pest density levels), the 
simple null and alternative hypothesis parameter 
values of the underlying random variable, and 
the associated risk levels (probabilities of a Type 
I Error (a) and a Type II Error (ft) J. The gap 
between the two class limits (the interval be- 
tween treatment and nontreatment thresholds) 
depends on the biology and behavior of the in- 
sect and its damage (Knight 1967; Waters 1955, 
1974). The two class limits define three decision 
zones; the two terminating decisions or category 
classifications such as no control (acceptance of 
the hull hypothesis) and control (acceptance of 
the alternative hypothesis), and the no decision 
zone (continue sampling). 

Because there are two class limits or simple 
hypotheses that are used to construct a SPRT, 
two types of error can occur in decisionmaking: 



(1) accepting the null hypothesis when the alter- 
native hypothesis is true and (2) accepting the 
alternative hypothesis when the null hypothesis 
is true. The probabilities (risks) of these errors 
must be set in advance according to the serious- 
ness of each error. 

After the decision boundaries for a given 
SPRT have been developed given the underlying 
distribution and the class limits and associated 
risks, the operating characteristic (OC) and 
average sample number (ASN) properties of the 
test should be determined. Wald (1947) has 
developed OC and ASN equations to describe 
the properties of the test for all possible values 
of the random variable of interest. The OC equa- 
tion or curve shows the probability of accepting 
the null hypothesis or lower classification, and 
the ASN equation or curve shows the average 
number of observations needed to make a ter- 
minating decision. The shapes of the OC and 
ASN curves depend upon the underlying distrib- 
ution and class limits and associated risks 
(Waters 1974, Onsager 1976). 

In all of the sequential sampling plans devel- 
oped to sample forest populations, it is assumed 
that Wald's OC and ASN equations describe the 
actual OC and ASN functions of the plan, which 
means that the nominal values used to construct 
a SPRT are assumed to be the actual values. 
Wald's equations are developed on the assump- 
tion that a terminating decision is made as soon 
as a decision boundary is crossed. This assump- 
tion is not true because of the integer nature of 
the decision process of a SPRT, which results in 
overshooting of the decision boundaries when a 
terminating decision is made (fig. 1) (Wald 
1947). Thus, Wald's equations are not accurate. 

Wald's equations also assume that (1) only one 
observation is taken at each stage of the sequen- 
tial process, (2) terminating decisions are possi- 
ble after one observation has been taken, and (3) 
there is no upper limit to the number of observa- 
tions taken before a terminating decision is 
made. Many forest researchers have modified 
the decision process of the SPRT by taking more 
than one observation at each stage of the sam- 
pling plan, not making a terminating decision 
until some minimum number of observations has 
been taken, and forcing a terminating decision to 
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Figure 1. Acceptance (dj and rejection (d v ) 
decision boundaries for the lodgepole needle- 
miner sequential sampling plan. Lines con- 
necting points (.) show a sequential sample of 
11 observations yielding a decision of light in- 
festation. Notice overshooting of the lower 
decision boundary. 



be made at some maximum number of observa- 
tions. Any of these modifications will, of course, 
affect the actual OC and ASN functions of a 
sampling plan. However, in all applications 
where one or more of these modifications are 
made, Wald's OC and ASN equations are still 
used to describe the properties of the sampling 
plan. 

This paper examines (1) the errors inherent in 
Wald's OC and ASN equations due to overshoot- 
ing of the decision boundaries and (2) the effects 
of the above modifications on the accuracy of 
Wald's equations for sequential sampling plans 
based on the normal distribution using Monte 
Carlo procedures. 



WALD'S SPRT - NORMAL 
DISTRIBUTION 

In forest sampling, Wald's SPRT is used to 
test the simple null hypothesis Ho : = against 



the simple alternative hypothesis H, ; = 9i {0. 
> 0o ), where is the test parameter of the distri- 
bution of the random variable X. The binomial 
distribution is used to describe X when X takes 
on one of two values (X = 1 for tree or plant or 
part of tree or plant infested and X - for tree 
or plant or part of tree or plant not infested). 
The negative binomial, normal, or Poisson dis- 
tributions are usually used to describe X when X 
is the number of insects per sampling unit. 
and 0, define the class limits for the two decision 
categories control or no control. The proba- 
bility of a Type I Error (a) is the probability of 
rejecting H when = , and the probability of 
a Type II Error (/J) is the probability of rejecting 
H, when = 9, . At the class limits, and 0, , 
the risk levels are set according to the serious- 
ness of the two errors. 

The two above simple hypotheses are used to 
develop the following decision rule: H ; <_8 
(no control) versusHi:0> 6 (control). The class 
limits, and 6, , are critical values of 6, If 0>0, 
(the zone of rejection), protect against accepting 
Ho by setting/? at 0, . If <0 {the zone of accep- 
tance), protect against rejecting H by setting & 
at . If <e <0 l (the zone of indifference), it 
does not matter what decision is made if is 
about halfway between B and 0, but becomes 
more important what decision is made as 0-*- 
and &-* 0, , This concern is expressed by the 
values of a and p set at and 0, , respectively. 
The values , 0, , a, and J3 given the distribution 
of X describe a particular SPRT. 



Decision Boundaries 

The SPRT bases its decisons on a sequence of 
observations (x, , x 2 , . . .) from the given dis- 
tribution of the random variable X. At each 
stage of the test, an observation is taken at ran- 
dom from the given distribution f(x,0), and the 
probability ratio 



based on n observations taken up to and in- 
cluding the n^ stage is calculated. At each stage 
one of the following decisions is made: 

(1) If R n J> A, stop sampling and reject H ft . 

(2) If R fl <B, stop sampling and accept Ho. 

(3) If B< R rt < A, continue sampling. 
A2(l-p)/ and B^/3/CM. The approximate 
equalities are due to the fact that the number of 



observations is a discrete integer variable that 
causes overshooting of the decision boundaries 
before a terminating decision can be made for a 
SPRT (fig. 1). Observations are taken until one 
of the terminating decisions is made. 

E n can be simplified by taking the natural 
logarithm of each density function ratio in the 
product R fl , which yields g,, = j, Z, where 2 ( = In 
[ f(x,,0, )/f(x,,0 )]. The decision procedure is 
now 

(1) If n j> a, stop sampling and reject Ho. 

(2) If $ < b, stop sampling and accept H . 

(3) If b < S n < a, continue sampling. 
a~ln Aandb-lnB. 

is usually converted to the statistic J, x,, 
which is easier to calculate, by setting K n =a 
and Z H - b and solving for Ji x, to determine the 
upper rejection <D W = J,x, =h 2 + sn) and lower 
acceptance (D L = ,l t x, ~ h t + sn) boundaries, 
respectively. The decision boundaries are parallel 
straight lines with intercepts h, and hi and com- 
mon slopes. The decision is now: 



(1) If iliXt > h 2 + sn, stop sampling and 
reject H . 

(2) If f.ix/^hi+sn, stop sampling and 
accept Ho . 

(3) If hi +sn<,! 1 x,<h I +sn, continue sam- 
pling. 

h, , h 2 , and s are calculated from , 0t , a, and /?. 
For the normal distribution, (x-ji ) J 
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when the test parameter 0=^ , the mean of the 
distribution and the nontest parameter o 2 is the 
variance of the distribution. 



Once the density function f(x), 9 ,B lt a, and /? 
are determined, the decision boundaries Dy and 
D are easily obtained from the probability ratio 
R fl . For the normal distribution, o 2 is assumed 
known and if unknown must be estimated. Even 
though the decision boundaries are parallel 
straight lines, the probability of making a ter- 
minating decision is one. The class limits used 
for decisionmaking are in terms of the mean of 
the underlying distribution and are functions of 
the test parameter 9, For the normal distribu- 
tion, the test parameter ^ is the mean of the dis- 
tribution. The equations for h, , h 2 , and s were 
summarized by Waters (1955) for the binomial, 
negative binomial, normal, and Poisson distribu- 
tions. Talerico and Chapman (1970) developed a 
Fortran IV computer program (SEQUAN) to 
calculate h,, h a , and s and plot the decision 
boundaries for,the four distributions above. 



A sequential sampling plan based on the 
normal distribution will be used as an example 
throughout this paper. Stark (1952) developed a 
plan for classifying lodgepole needleminer 
(Recurvaria milleri Busck.) populations as light, 
medium, or heavy to make preliminary surveys 
of needleminer outbreak areas. The sampling 
unit was a branch tip including needles up to 5 
years old, and the random variable was the num- 
ber of live larvae per branch tip. It was deter- 
mined that the number of live larvae per branch 
tip followed approximately a normal distribu- 
tion, and the standard deviation a was estimated 
to be 15.62. One of the SPRT's in the three de- 
cision (2SPRT) procedure was used to test 
whether insect infestations were light or medi- 
um. The class limits \i and ^ , were set at 5 
and 15 larvae per branch tip, respectively, and a 
and /? were set at 0.05 and 0.10, respectively. 



The decision rule was to classify an infestation 
as light if n <H D =5 and to classify it as heavy 
if H > no = 5. If n - 5 for a specific needleminer 
population, the SPRT would indicate a medium 
infestation 1 out of 20 times when the infestation 
was light. If fj =15 for a specific needleminer 
population, the SPRT would indicate a light in- 
festation 1 out of 10 times when the infestation 
was medium. Using the formulas for the normal 
distribution, h, , h 2 , and s were calculated to be 
-54.93, 70.52, and 10, respectively (fig. 1). 



Wald's OC and ASN Equations 

The OC function L(0) is the probability of ac- 
cepting Ho as a function of 6. Wald's OC equa- 
tion is L(0)^(A M0 >~1)/A''<>-B'"0') where A 
and B are as defined earlier and h(&) is such that 



f<x,0, 



h(0) 



f{x,0)dx = 



f(x,0 



I f(x,0, 
x 



f<X,0o) 

where x is a continuous or discrete variable, re- 
spectively, and h(0)*0 {Wald 1947). To obtain 
points on the OC function, one of the above equa- 
tions is solved for h(0), Q is determined for vari- 
ous values of h(0), and L(0) is calculated. When 
h(0) = 1 and -1, - and 0i , respectively. For 
the case where h(0)=0, L<0) a/(a-b) (Wald 
1947). For the normal distribution, 



( >" 



i)/(A' 1 ""-B" 



where - ^ . 

The ASN function E (n) is the average num- 
ber of observations needed to make a terminat- 
ing decision. Wald's ASN equation is 

E fl (n) 2E (2 n )/E fl (Z) for h(0) 
where Z fl = J,2, as defined earlier, E (n) ~bL 
(0) + a[l-L{0)], and E 9 (Z) ~E [ ln(f(x,0, )/ 
f(x,0 ) ) ] (Wald 1947). To obtain points on the 
ASN function, E (n) is calculated for several 
values of determined from h(0) above. For the 
case where h(0) -0, E e (n) ^ab/E fl (2? ) (Wald 
1947). For the normal distribution, 




OC and ASN points for h(0) = 4(-0.5)-4 will 
usually describe the OC and ASN functions ade- 
quately. Formulas for Wald's OC and ASN equa- 
tions for the four distributions discussed earlier 
were summarized by Waters (1955). Talerico and 
Chapman (1970) and I have developed Fortran 
IV computer programs to calculate OC and ASN 
points for selected values of Jh{0). My program 
also calculates Monte Carlo OC and ASN points 
for selected values of h{0). Wald's OC and ASN 
points for various values of h(n ) and n are given 
in table 1 for the sampling plan presented earlier 
to classify populations of the lodgepole needle- 
miner, and the resulting OC and ASN equation's 
are shown in figure 2. 
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Figure 2. Wald's approximate OC (L(y } ) and 
ASN (B\*(n) ) functions for the lodgepole 
needleminer sequential sampling plan. Notice 
that L(n)~l - a -0,95 at n =n -5 and 
Lfo ) ~ p = 0.10 at n =u, =16. 



As mentioned earlier, Wald's OC and ASN 
equations, and associated nominal a and /J, are 
only approximate in that they are based on the 
assumption that a terminating decision is made 
as soon as one of the decision boundaries is 
crossed. Because the number of observations is a 
discrete integer variable, overshooting of a 
decision boundary almost always occurs before a 
terminating decision is made. Wald (1947) states 
that the errors inherent in his equations due to 
overshooting are small if o- and /? are small (less 
than 0.05) and the class limits and O t are suf- 
ficiently close together. Because a and /3 are 
usually 0.10 (0.05 at the smallest) and the class 
limits are wide for most sampling plans in for- 
estry, Wald's equations may not be good approx- 
imations to the unknown actual OC and ASN 
functions. 



Sequential sampling plans require substan- 
tially fewer observations than equally-reliable 
fixed sample size procedures. For the lodgepole 
needleminer example, where p - 5, /i i = 15, 
0=16.62, a =0.05, and 0=0.10, approximately 
21 observations are needed to yield a = 0.05 
@Ho=5 and /?=0.10 @jj , =15 for a fixed 
sample size - test. Compare this sample size 
with the ASN function for the sequential sam- 
pling plan (fig. 2). The advantage of the sequen- 
tial sampling plan is clear. However, in compar- 
ing the two procedures, it must be remembered 
that Wald's equations are only approximate. 



Table 1. Wald's Operating Characteristic (L{\j. ) ) and Aver- 
age Sample Number (Ey. (n) ) points for several values of 
h(\i. ) and \* for the lodgepole needleminer sequential sam- 
pling plan 
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Figure 4, Comparison of Monte Carlo Average 
Sample Number (ASN) functions for sequen- 
tial sampling plans with u a =5, u, -15, 
= 15.62, a=0.05, and &-Q.10, for numbers 
of observations taken at each stage of the plan 
equal to 1, 2, 5, and 10. The ASN function for 
the fixed sample size plan with equal reliability 
is also shown. 



It must be remembered that the ASN value is 
the average number of observations needed to 
make a terminating decision. The number of 
observations needed to make a terminating de- 
cision for one sample is a random variable called 
the decisive sample number (DSN), and the ASN 
is the mean of this random variable. The serious- 
ness in the errors of Wald's equations can be bet- 
ter shown by looking at the distribution of the 
DSN (table 4, fig. 5). The distributions of DSN 
are skewed to the right; the skewness increases 
as ASN increases. The number of observations 
needed to make a decision for a particular se- 
quential sample can be much larger than ASN. 



Procedure to Obtain More 
Accurate OC and ASN Functions 

Monte Carlo results for the normal distribu- 
tion showed that the errors inherent in Wald's 
OC and ASN equations can be large and are a 
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Figure 5. Empirical distributions of the De- 
cisive Sample Number {DSN) for three values 
of n for Wald's Sequential Probability Ratio 
Test (SPRT) with & = 0.05 and p =*0.10. The 
dashed line . shows the mean Average Sample 
Number (ASN) for each DSN distribution. 



function of the difference between the pest den- 
sity levels or class limits (dj = (p , -^ )/a ) and 
associated error probabilities used to construct a 
given sequential sampling plan. The practical 
consequences of these errors are (1) the actual 
error probabilities can be smaller than the nom- 
inal error probabilities used to build the sam- 
pling plan, and (2) more field observations are 
made than necessary. If sampling is destructive, 
time consuming, expensive, and/or early deci- 
sions are desirable, the consequences of these 
errors can be severe. Monte Carlo procedures 
should be used to determine if these errors are 
serious for a given sampling plan, and if they 
are, new sequential sampling plans should be 
developed to yield actual error probabilities, and 
resulting OC and ASN functions, approximately 
equal to the nominal ones desired. 



In developing a sequential sampling plan, 
Monte Carlo OC and ASN functions should be 
obtained for the decision boundaries based on 
the nominal values of a and p. If a and /3 are 
close enough to a and /3 and Wald's and the 
Monte Carlo OC and ASN functions are similar, 
the Monte Carlo functions should be used to des- 
cribe the operation of the sequential sampling 
plan. If the errors associated with Wald's equa- 
tions are serious, new nominal values of a(a') and 
p(p') should be determined to yield new Monte 
Carlo estimates a and (3 that are approximately 
equal to the desired (old nominal values) of a and 
p. Monte Carlo OC and ASN' functions can then 
be obtained for the new sequential sampling plan 
based on the new nominal error probabilities <x 
and ft'. If a'and /? are not close enough to a and /J, 
as many iterations of this procedure as are neces- 
sary to yield the desired Monte Carlo values of a 
and ft should be used. Usually one or two itera- 
tions will be sufficient. 

Considering the lodgepole needleminer prob- 
lem, if the difference between the Monte Carlo 
OC and ASN functions with = 0.036 and 
0=0.073 and Wald's OC and ASN functions 
with a = 0.05 and ft =0.10 (table 4, figs. 3 and 4) 
are not_ considered important, then the Monte 
Carlo OC and ASN functions should be used to 
describe the operation of the original sampling 
plan based on a = 0.05 and ft = 0.10. 

If the errors in Wald's equations are consid- 
ered serious, a new sampling plan should be con- 
structed. & and ft were determined to be 0.035 



and 0.072, respectively, based on 20,000 itera- 
tions each. To determine new nominal values a 
and ft' that yield new Monte Carlo values 5'and ft 
that are approximately equal to the desired old 
nominal values o and ft, a'~(a/a)a-a 2 /a and 
ft'=(ft/ft)ft*=ft 2 /ft. For our example, &'=(0.05)V 
0.035 = 0.0714 and /J'- (0.10) 2 /0.072 = 0.1389. 
Using these new nominal values, new decision 
boundaries can be developed for a new sequential 
sampling plan and Monte Carlo OC and ASN 
functions can be obtained (table 6). a = 0.051 
with SD= 0.003, and /?'= 0,097 with SD^ 0.004. 
Notice the difference between the ASN values 
for the new sampling plan and the ASN values 
for the old sampling plan (table 4). Also, com- 
pare the empirical distribution of DSN'of the 
new sampling plan (table 5, fig, 6) with the 
empirical distributions of DSN for the old sam- 
pling plan (table 4, fig. 6) for p ~ -10, 5, and 10. 



Notice the close agreement between OC and 
ASN'of the new plan and OC and ASN of the old 
plan. A comparison of the ASN, function of the 
new sampling plan (a = 0.051, ft = 0.097) (table 
5), which is closely approximated by the ASN 
function of the old sampling plan {fig. 4), with the 
ASN function of the fixed sample size 55 - test 
sampling plan (FIXED, a = 0.05, 0=0.10, n = 
(> + Zp) a /dj -21ji is more valid than the com- 
parison with the ASN ^function of the old sam- 
pling plan (ff = 0.035, =0.072). This compari- 
son is also more in favor of the sequential pro- 
cedure. 



Table 5. Monte Carlo Operating Characteristic (OCJ and 
Average Sample Number (ASN) values for several values of 
h(n ) and p for the lodgepole needleminer sequential sam- 
pling plan (u a -5, \i i -15, a~ 15,62, a = 0,0714 f and ft'= 
0,2389) 
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Figure 6. Empirical distributions of the De- 
cisive Sample Number {DSN') for three values 
of \jt for Wold's Sequential Probability Ratio 
Test (SPRT) with a' =0.0714 and $' = 0.1389. 
The dashed line^shows the mean Average Sam- 
ple Number (ASN) for each DSN distribution. 



Wald's OC and ASN equations are still used to 
describe the properties of the modified sampling 
plan. 



To investigate the effects of these modifica- 
tions on the accuracy of Wald's equations in des- 
cribing the actual OC and ASN f unctions ^of the 
modified sampling plans, Monte Carlo OC and 
ASN functions were obtained for the following 
sampling plans for the lodgepole needleminer 
example (n = 5, f( i = 15, o 15.62, a = 0.06, 
= 0.10): 

(1) Wald's SPRT with no modifications. 

(2) Wald's SPRT with 2, 5, and 10 observa- 
tions taken at each stage. 

(3) Wald's SPRT truncated at 10, 16, and 21 
observations. 

(4) Wald's SPRT with terminating decisions 
first possible at a minimum of three and 
five observations (minimum points). 

(5) Wald's SPRT with (a) a minimum point of 
5, (b) a truncation point of 10, (c) a mini- 
mum point of 5 and a truncation point of 
10, and (d) a minimum point of 5, a trunca- 
tion point of 10, and 5 observations at 
each stage. 

To obtain OC and ASN values; 5,000 and 1,000 
iterations were used for each sampling plan at 
h(^ ) = 1 and h{/4 ) At 1, respectively. 



MODIFICATIONS OF 
WALD'S SPRT 

Many forest researchers (Cole 1960; Connola 
et al 1959; Ives 1954; Ives and Prentice 1958; 
Knight 1960a, b; Tostowaryk and McLeod 1972) 
have modified the decision process of sequential 
sampling plans based on Wald's SPRT by taking 
more than one observation at each stage of the 
plan, not making a terminating decision until 
some minimum number of observations has been 
taken, and/or forcing a terminating decision to 
be made at some maximum number of observa- 
tions. These deviations from the assumptions of 
Wald's SPRT will, of course, affect the actual 
OC and ASN functions of the modified sampling 
plan. However, in all forest applications where 
one or more of these modifications are made, 



Two to 10 observations taken at each stage 
represent the range encounter in forestry sam- 
pling plans, A truncation point of 16 was ob- 
tained using Water's (1974) suggestion of using 
the maximum of Wald's ASN function as the 
maximum point, and a truncation point of 21 
was obtained from Wald's (1947) rule for trunca- 
tion, which is to use the sample size of the equal- 
ly reliable (a = 0.05, /? = 0.10) fixed sample size B 
- test as the maximum point, A truncation point 
of 10 is approximately the average truncation 
point encountered in forestry sampling plans 
Wald's ASN value at h(n ) = 1. If a terminating 
decision has not been made when the truncation 
point is reached, the average of the acceptance 
and rejection value at that stage is used as the 
decision point. The largest minimum point used 
in forestry sampling plans is 5. 
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Monte Carlo results were obtained with the 
same starting seed value for the random number 
generator for sampling plans with number of 
observations equal to 1, 2, 5, and 10; truncation 
points of 10, 16, and 21 ; and a minimum point of 
5. Monte Carlo results were also obtained for 
sampling plans with number of observations 
equal to 1 and a minimum point of 5 using a dif- 
ferent starting aeed. This was done to reduce 
Monte Carlo variability in comparing the dif- 
ferent modified sampling plans and to show 
Monte ^Carlo variability in the case of the OC 
and ASN values for Wald's SPRT (number of 
observations 1) (table 6). 

The Monte Carlo results showed that a and /3 
become smaller and the associated ASN values 
at h(n ) = 1 becomes larger as the number of 
observations taken at each stage, truncation 
point, or minimum point increase (table 6). 
These trends are distinct for the range of 
number of observations taken at each stage and 
the truncation points considered and slight for 
the range of minimum points considered. The 
Monte Carlo variability in estimating a and 
was larger than the Monte Carlo variability in 
estimating the associated ASN values at 
b(u)=l for Wald's SPRT with no modifica- 
tions. 

The ASN functions increased substantially as 
the number of observations at each stage in-, 
creased, and the ASN functions decreased sub- 
stantially as the truncation point decreased 



(figs. 4 and 7), Wald's SPRT with Wald's trunca- 
tion point (21) yielded as ASN function closer to 
the ASN function of Wald's unmodified SPRT 
than both Wald's SPRT with Water's truncation 
point (16) or the average truncation point used 
in forestry examples (10). However, the ASN 
function is still larger than the ASN 2 . function 
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Figure 7. Comparison of Monte Carlo Average 
Sample Number {ASN) functions for sequential 
sampling plans (u =5, u i -15, a -15,62, 
= 0,05, and ft =0,10) for truncation points of 
N~ 10, 16, 21, 



Table 6. Comparison of Monte Carlo values of a and /? and Average Sam- 
ple Numbers (ASN's) at h(u) = 1 for sequential sampling plans (\A =5, 
H i "IS, a=15.62 t a -0.05, and /3 =0.10) for various numbers of observa- 
tions taken at each stage of the plan, various truncation points, or various 
minimum number of observations 



hCv) i L(u) 


: 


Number 


of observations t 


Truncation points 


: Minimum number 
-; of observations 


I ! 


i I 1 ' 


i 2 : 


5 : 


10 ' 


10 


i 16 


: 21 


' 3 1 


1 5 


a 2 


0.036 


0.032 


0,031 


0.023 


0.020 


0.123 


0,077 


0.061 


0.032 


0.035 


1 






















ASH 


11.84 


11.86 


13,19 


15.54 


18.84 


7,86 


9. a? 


10.74 


11,89 


12.22 


e 3 


.073 


,071 


.066 


.049 


,031 


.202 


.140 


.111 


.071 


.069 


-1 






















ASH 


14.35 


14.27 


15.22 


17.60 


21.25 


8.62 


11.28 


U.53 


M.28 


14.58 



on a different Monte Carlo run than the other Operating Characteristic (Ot) and AN values. 
Z 5 - 1 - 06 - 1 - L(u) for h(y) - 1. 
3 6 - Qt L(v) for h(u) - -1, 
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Monte Carlo results were obtained with the 
same starting seed value for Wald's unmodified 
SPRT and for Wald's SPRT with one observa- 
tion at each stage and (a) a minimum point of 3, 
(b} a truncation point of 10, and {c) a minimum 
point of 3 and a truncation point of 10 {table 7). 
Monte Carlo results were also obtained with a 
different starting seed value for Wald's unmodi- 
fied SPRT and for Wald's SPRT with one obser- 
vation taken at each stage and (a) a minimum 
point of 5, (b) a truncation point of 10, and (c) a 
minimum point of 5 and a truncation point of 10. 
Rlonte Carlo results were also obtained for 
Wald's SPRT with 5 observations taken at each 
stage, a minimum point of 5, and a truncation 
point of 10. 

The Monte Carlo results showed that in com- 
parison to Wald's unmodified SPRT: 

(1) a minimum point decreased^ and and 
increased the associated ASN values at 
h(/j ) = + 1 with these effects increasing 
slightly as the minimum point increased 
from 3 to 5; 

(2) a truncation point increased a to /5 and de- 
creased the associated AiN values at 
h(M) = 1; 

(3) the effects of a minimum and a truncation 
point above and beyond the effects of a 
truncation point A are none to small de- 
creases in a and /? and small increases in 



the associated ASN values at h{/j ) = 1 
with these effects increasing slightly as 
the minimum point increased from 3 to 5; 
(4) the effects of more than one observation at 
each stage above and beyond the effects of 
a truncation and,, minimum point are de- 
creases in a and /? and increases in the as- 
sociated ASN values at h(n ) = 1. 

The three modifications of Wald's SPRT con- 
sidered in this paper definitely affect the accu- 
racy of Wald's equations in describing the actual 
OC and ASN equations of the modified sampling 
plans. The size of the errors in Wald's equations 
depends on how far these modifications deviate 
from the assumptions of Wald's SPRT and what 
combination of modifications are used. 



CONCLUDING REMARKS 

Assuming a normal distribution, a comparison 
of Wald's and Monte Carlo OC and ASN func- 
tions indicated that the errors inherent in Wald's 
equations can be serious. The practical conse- 
quences of these errors are: (1) the actual error 
probabilities can be smaller than the nominal 
error probabilities used to build the sampling 
plan, and (2) more observations are usually 
taken in the field than necessary. These errors 



Table 7. Comparison of Monte Carlo values of a and /J and Average Sam- 
ple Numbers (ASN's} at hfo )= 1 for sequential sampling plans fa = 5, 
Mi =-/5, = 15.62, a =0.05, and p=0,10) for two Monte Carlo runs with 
various combinations of numbers of observations taken at each stage of the 
plan, truncation points, and minimum number of observations 



Run number : 


1 








2 






No. of observations 


! 




1 








1 


. 


5 


Minimum 1 


: 1 : 


3 


: 1 


: 3 


t 1 


5 


: 1 


: 5 




Truncation 


point* 


CD 




: 





: 


TO 


. 


10 




h(u) = 1 


a 2 


.032 


.032 


.115 


.115 


.036 


.035 


.123 


.123 


.119 


t 


l3N 


11,86 


11.89 


7.84 


7.88 


11.84 


12.22 


7.86 


8.19 


9.01 


h(y) = -i 


B 3 


.071 


.071 


.194 


.194 


.073 


.069 


.202 


.200 


.196 


ASN 


14.27 


14.28 


8.64 


8.65 


14.35 


14.58 


8.62 


8.78 


9,45 



- _ ^ .^ b ^ a ,, u ,,. J ., lJ . wum puj.ui., a truncation pome or indicates no truncation poit 

and a minimum of 1 and a truncation point of indicates an unmodified sequential sampling plan, 
z a = 1 - OC - 1 - LOO for h(u) = 1. 
3 = OC = L(u) for h(u) = -1. 
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increase as the difference between the clasa 
limits (d, ~(fj,-^ )/o) and associated error 
probabilities (a and ft) used to build a sequential 
sampling plan increase. Similar results have 
been obtained for the binomial, negative binom- 
ial, and Poisson distributions. 

Also assuming a normal distribution, Monte 
Carlo results indicate that any of the above mod- 
ifications will affect the actual OC and ASN 
functions of the sequential sampling plan and 
thus decrease the accuracy of Wald's equations 
in describing these actual functions. The prac- 
tical consequences of these modifications are (1) 
the actual error probabilities decrease and the 
ASN function increases as the number of obser- 
vations taken at each stage increases, (2) the 
actual error probabilities increase and the ASN 
function decreases as the truncation point de- 
creases, and (3) the actual error probabilities 
decrease and the ASN function increases as the 
minimum point increases. Similar results would 
be obtained for the binomial, negative binomial, 
and Poisson distributions. 

Regardless of what distribution and whether 
no or any combination of the above modifi- 
cations were used in developing a sampling plan 
based on Wald's SPRT,_ I strongly suggest that 
Monte Carlo OC and ASN functions be obtained 
for the decision boundaries of the original SPRT. 
If a, /5, and the Monte Carlo functions are con- 
sidered to be not seriously divergent from a, /J, 
and Wald's equations, then the Monte Carlo 
functions should be used to describe the actual 
functions of the original sampling plan. 

If, on the other hand, the differences between 
Wald's and Monte Carlo results are substantial, 
a new sampling plan should be developed using 
the procedure presented in this paper so that the 
Monte Carlo values a 'and /? of the new plan are 
approximately equal to the nominal values a and 
/3 of the old plan. Monte Carlo OC'and ASN func- 
tions should then be obtained for the decision 
boundaries of the new plan and used to describe 
the actual functions. 

Given the budget available, the time and coat 
of taking observations, whether sampling is des- 
tructive or not, and the importance of early 
decisions, the forest researcher will have to 
decide whether to use fixed sample size or se- 
quential sampling plans. If a sequential sam- 
pling plan is used, the forest researcher then 



must decide whether to construct the new or old 
sequential sampling plan and associated Monte 
Carlo OC and ASN functions. The only fair com- 
parison is between the fixed sample size and the 
new sequential sampling plan. For either sequen- 
tial sampling plan, the cost of obtaining OC and 
ASN functions is inexpensive. 
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The term stand stocking used in this paper 
refers to the density of treea in a stand expressed 
by the formula given by Gingrich ( 1967) 1 : 

Stand Stocking Percent (SSP) = 

(-0.0507N + 0.1698 2D + 0.0317 H) 3 )/10 

where 

N = number of trees per acre 
H) = sum of then- individual diameters 
5D 2 = sum of the squares of their diameters. 
Each tree in the stand must be tallied and mea- 
sured to derive stocking percent using the above 
equation. This process is time consuming, so 
sampling methods are used to estimate the quan- 
tities needed for the stocking equation. 

One sampling method, which I call "variable 
plus fixed area sampling" (F), provides an esti- 
mate of basal area from the variable area plot 
and an estimate of the number of trees from the 
fixed area plot with origin in common with the 
point. 

Typically, the fixed area plot is circular with 
1/20 acre area. Thus 

BA = basal area/acre = BAF x k = basal area 

factor x point sample tree count 
n = no. of trees/acre = fixed plot tree count/ 
fixed area plot size 

Db /= diameter of tree of average basal area 
= ( (BAFxk)/0,005464n)'/ 2 



Now substituting 



* Gingrich, S. F, 1967, Measuring and evalu- 
ating stocking and stand density in upland hard- 
wood forests in the Central States, For, Sci, 
13:38-53. 



H) 1 ==(BAFxk)/0.005454 
into Gingrich's equation we have 



= <-0.0507n + 0,1698nD 6/ + (0.0317BAF 
k)/0.006454)/10 



or 



SSP = -0.00607n + 0.2299 n ( (BAF x k)/ 
n) 1 / 1 +0.58122 BAF xk. 



The second method uses only the trees sam- 
pled from a variable area plot. In this case sam- 
ple tree diameters must be measured. 

Thus 



n = ,!.BAF/0.005454d, 2 i = l,2, . 
sample trees 

HBAF/0.006464) ( ?,l/oV 
ID - ,! (BAF/0.005464d, 2 ) d, 

HBAF/0.006464) Ji l/d f 
2D 1 = ^BAF/0.005454d ( 1 ) d, 1 
= (BAFxk)/O.OOB454 



. ,k; k 
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and substituting 
SSP = (-0.0507 



BAF 



BAF 



0.005454 
1 



+ 



0.005454 



d, 



BAF*k 
+ 0.0317 )/10 



or 



0.005454 
BAF 



cjqp 

OOr 



i.8122k + 31. 



I 
( = 1 



10 



d, 



-9.2959,1, 



d, 1 



Note that when BAF = 10 then 

* 
SSP - 5.8122k + 31.1331,!, 



-9.2959,t, 

d, d/ 



Frequently, in addition to stocking we need to 
know the diameter of the tree of average basal 
area. For the variable plus fixed plot method this 

is 

k * BAF v, 
D,, = ( ) 

05454n 



and for the variable plot method is 

k 

D== ; 1 
,^ , - 



The stocking and average diameter equations 
differ in each method by the way in which the 
number of trees per acre are obtained. These dif- 
ferences affect the estimate obtained by each 
method. From these equations we can see that 
the variability of point sample estimates in- 
crease as the variability of tree diameters in- 
crease among points whereas among plots the 



variability of estimates are related to the varia- 
bility in the number of trees sampled among 
plots. 

Thua the variability of stocking and average 
diameter estimates is sensitive to the distribu- 
tion of tree diameters within a stand. To see how 
each method compares in its ability to estimate 
stocking and average diameter in stands with 
different diameter distributions, I simulated the 
sampling process on a computer using four 
stands that had different diameter distributions 
(fig. 1). Trees were located randomly within each 
stand. Then for each stand 10 points were sel- 
ected and 1/20 acre circular plots were located 
with their center at each point. For each sam- 
pling method stand stocking percent and mean 
stand diameter were averaged over the 10 
points. This procedure was repeated 100 times 
while keeping tree location constant. In addition, 
stocking and average diameter were calculated 
using all trees in the stand and these were desig- 
nated as true values. 
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Figure 1. Diameter structures of two aged, 
pole, uneven-aged, and even-aged stands. 
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Each method estimated stocking percent with 
about the same accuracy except the uneven- 
aged stand was estimated more closely by the 
variable plus fixed plot method (fig. 2), The 
variable plot method deviated farthest (under- 
estimated) true stocking percent and had a vari- 
ance l l /a times greater than that of the variable 
plus fixed plot method for the uneven-aged 
stand. 

The comparison of average diameter revealed 
that the methods were similar in their estimates 
for the two-aged and pole-sized stands (fig. 3). 
But, their estimates were not similar for the 
even- and uneven-aged stands. In uneven-aged 
stands estimates obtained from the point sample 
are biased upward and the variance is QVz times 



larger than for the point plus fixed plot method. 
In the even-aged stand the pattern in variance is 
reversed, but estimates are not biased. 

Therefore, point sampling methods alone 
should not be used to estimate stocking and 
average diameter in uneven-aged stands. But the 
point sampling method can be used to provide 
estimates of stocking and average stand diam- 
eter in stands having distributions similar to the 
other three studied with a reliability equal to or 
better than the combination method. In particu- 
lar, for stands having diameter distributions like 
the even-aged stand presented here, estimates of 
mean stand diameter obtained by point sampling 
tend to be more reliable than those obtained by 
using the combination method. 
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Figure 2. A comparison of stocking precent 
estimates based on 100 averages of 10 points 
in four stands using variable area plot (V) and 
variable area plot plus fixed area plot (F) sam- 
pling methods. Lines through bars represent 
estimates of true value of stocking percent 
(line at 0) and vertical scale shows deviation 
values, 



Figure 3. A comparison of mean stand diam- 
eter (tree of average basal area) estimates 
based on 100 averages of 10 points in four 
stands using variable area plot (V) and varia- 
ble area plot plus fixed area plot (F) sampling 
methods. Lines through bars represent esti- 
mates of true value of mean stand diameter 
(line at 0) and vertical scale shows deviation 
values. 



16 



Chung-Muh Chen, Forest Biometrician, 

Minnesota Department of Natural Resources, 

St. Paul, Minnesota 

and Dietmar W. Rose, Associate Professor, 

University of Minnesota, 

St. Paul, Minnesota 



Previous models of individual tree growth 
have been based on the open-grown tree concept 
(Newnham 1964, Arney 1974, Ek and Monserud 
1974) and on empirical functions (Bella 1971, 
Tennent 1976, Moore et al 1973, Adlard 1974). 
In the open-grown tree approach, individual tree 

size is assumed to be maximum if the tree is 
grown in the open (free from competition); the 
maximum size is reduced if the tree is grown in a 
stand (subject to competition). 

Most periodic growth models for individual 
trees express the amount of competition a tree 
receives from its neighbors. Many view compe- 
tition between trees in terms of zones of in- 
fluence (Staebler 1951, Newnham 1964, 1966,. 
Opie 1968, Bella 1971, Gerrard 1969, Arney 
1974, Keister 1971, Ek and Monserud 1974). The 
basic assumption is that competition between 
individual trees occurs only when their zones 
overlap. 

The objectives of this study were to formulate 
nonlinear biological basal area growth models 
and to analyze the relation between individual 
tree competition and growth. We used data from 
an even-aged red pine plantation near Star Lake, 
Wisconsin. 1 Initial spacing of the plantation was 
6 by 6 feet. Site index was 65 at age 50 and tree 
survival was high on the site (Wilson 1963). 



1 The data were provided by Dr, Alan R. Ek, 
University of Minnesota, College of Forestry, 
and Wisconsin Department of Natural Re- 



sources. 



FORMULATION OF 

NONLINEAR BASAL AREA 

GROWTH MODEL 

Periodic basal area growth can be related to 
potential basal area increment when the im- 
mature tree is free from competition. This 
growth can then be reduced by a competition fac- 
tor when the tree is grown under competition, 
This relation can be shown by the following 
equations: 

AB = AB*[l-e- M """]" (1) 
orAB=AB*e-* c (2) 

where AB = the periodic tree basal area growth 

of the tree, 

A B* = the potential growth when the tree is 
free from competition, 

C = the competition index of the tree, 

k, m, and N = growth factors such as site, 
species, age, and density. 

AsCt.ABl ;asCI ; ABt 

HmAB=AB*fork>0,m> 



lim B = for intolerant species. 



The advantage of the models (equations 1 and 
2) is that they offer a logical explanation of the 
relation between the dependent and independent 
variables. However, the potential growth of a 
tree might not be realized not only because of 
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competition, but also because of disease, insects, 
animals, wind, frost, etc. Furthermore, the 
model is less useful for predicting growth of 
intermediate and suppressed trees when they are 
completely released because these trees may not 
regain their full potential growth. So the con- 
dition that 

AB-"AB*asC-*0 
may not be fully realized. 

An alternative approach is to assume that tree 
periodic basal area growth is directly related to 
site, initial basal area, and functional crown sur- 
face, and inversely related to competition index. 
Functional crown surface is correlated with tree 
growth. Tree height or basal area are correlated 
with the functional crown surface for immature 
individual trees growing in dense even-aged 
stands (not stagnant) of single intolerant 
species. Therefore, in the case of lacking informa- 
tion on tree height and functional crown surface, 
initial tree basal area or diameter breast height 
and competition index may predict tree periodic 
basal area increment. For this study, we define 
two models similar to equations 1 and 2, except 
AB* was replaced by "a B 6 "; where B is the 
basal area of the tree at the beginning of the 
growth period and a and b are two additional 
factors, 

MEASURE OF 
TREE COMPETITION 

The competition imposed on a tree is measured 
by the distance to the competing tree, amount of 
overlap area, and relative size. It is formulated 
in the following manner: 



(3) 



where C, = the competition index of tree i, 

A, = the area of the influence zone or circle 
about the sample tree i with the 
radius (P,) proportional to the tree 
diameter, 

P, 1, 1,6, 2.0, 2.6, 

O tf = the area of overlap between tree i 
and its competitor j for j = 1, 2 ..... 

N,, 



N, = number of competitors of tree i, 
W,j - a weighting factor for O,j. 
We used the following weighting factor: 



where 



r w = (dj/d,) E and dj/d, is the ratio of dia- 
meters of tree i and its competitor j, 

E = max (d,/d,) for all trees (i, j ) with 
overlapping influence zones, 

ki and E are factors related to species, 
age, site, and density (E = 1 to 4). 



k, 



(5) 



when weighting factor Wy = 1 is assigned for the 
condition d, = d,, n was assigned the value 1 or 2. 

As ry-0,W,/*0; 



and as 

The weighting factor for modifying the overlap 
area between a tree and its competitor should re- 
flect the potential size difference of the sample 
tree i and its competitor j. 

RESULTS AND DISCUSSION 

Approximately optimal values of Pi and E (for 
equations 3 and 4) for standard error and correl- 
ation of regression between basal area growth 
and competition index were derived via iteration 
(table 1). Bella's (1971) index generated com- 
parable results. When no iterations on E and n 
were performed, i.e., Enl Chen's (1976) 
model resulted in correlations that were approxi- 
mately 10 percent higher. A modification of 
Gerrard's (1969) index (inclusion of the exponent 
E) proved somewhat inferior to the two other 
indexes, but is easier to calculate. The indexes 
will have to be compared using different species, 
ages, and stand conditions. 

For predicting individual tree growth one of 
the two proposed basal area growth models was 
fitted via nonlinear regression (table 2). The 
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Table 1, Correlation between periodic basal area growth 
and competition index for three competition models 1 



Plot 2 


Observations 


: Coefficient of 


Determination 3 : Standard : Standard Recession 


Coefficients 


Total d : 


Regression 11 : Ad-lusted i 


Unadjusted 


: error 


; a ; 


b 


: k 


1 


186 


17 


0.55 


0.85 


Inch* 
2.84 


34.63 


-0.4571 


1.5557 


2 


203 


20 


.52 


.86 


2.16 


3.25 


.1137 


0.7839 


3 


269 


19 


.88 


.95 


0.52 


0.043 


1.2744 


.2038 


1, 2, & 3 
combined 


658 


56 


.60 


.86 


1.81 


1.38 


.3682 


.7912 


4 


296 


24 


.56 


.80 


5.17 


3134.23 


-1.4585 


1.9977 



aB D (exp (-kC)) 
AB " the periodic tree basal area growth in square inches. 
C D the competition index (equation 3) with weighting factor (equation 4), 
Parameter E=2,R=l,n i =2 used for all in calculation of competition index. 

2 Stand age 32-37. 

3 Total observations used for calculating competition Index. 

''Number of trees for which growth and competition was determined. 

5 The coefficient of determination adjusted for the mean measures the contribution of the sample mean in 
explaining variance of the dependent variable. 



Table 2. Nonlinear regression gits for a nonlinear basal area growth 

models 1 



> \ i Standard 
ploc , r . error 


E 3 ) Pi 11 \ n | Remarks 


mot 


2 


IfKJ/J 2 




i 


-0.82 


1,26 


212 Clian (1976) model 


2 


-0.72 


1.41 


2 2.5 1 y, - (1-e -ki( d j/ d P ) n 


3 


-0.73 


1.60 


2 1.0 2 (equation 4) 


4 


-0.70 


2.42 


1.5 1.0 2 


1 


-0.82 


1.27 


3.5 1.0 - Bella (1971) 


2 


-0.72 


1.42 


2.0 2.5 


3 


-0.73 


1.60 


4.0 1.0 - Wjj " (dj/dijE 


4 


-0.70 


2.43 


1.5 1.0 


1 


-0.72 


1.50 


4 2.5 - modified Gerrard index 


2 


-0.63 


1.57 


4 2.0 


3 


-0.72 


1.62 


4 1.0 - C t - I i^- } 


4 


-0.63 


2.63 


2 1.0 - w ij * 1 


'aiL ae~ kc i or log 
(competition index) and W^ 
z Plot age 32-37 [plots 


Hi OJ4 

fiBj " log a - kC^ where G^ !.* _JJ. y 
j-l A i J 
is a weighting factor. 

1 to 3 are control plots and plot 4 ia a 



thinned plot. 

3 E - parameter of competition index 

*P1 - proportionality factor for radius of zone of influence of 
subject tree and tree diameter. 
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model adequately explained basal area growth as 
a function of basal area and competition for the 
first three plots. The large standard error for the 
thinned plot (plot 4} indicates a much poorer fit. 

Further work is underway to test those models 
using more plots and different species and ages. 
Preliminary results indicate less satisfactory fits 
for an older plot of the Star Lake plantation (age 
58-65). Relative diameters or basal areas are ade- 
quate weighting factors only in some instances. 
It will be necessary to find weighting factors 
that better reflect the potential size increment of 
the sample tree i and its competitor j such as 
height, crown ratio, live crown length, or func- 
tional crown surface or volume. These variables 
tnay improve the growth models for red pine as 
well as for other species in even-aged stands. 
Species tolerance will have to be considered if 
the competition model is applied to mixed stands 
(Chen 1976). 
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By observing the evolution of natural phenom- 
ena, it is readily apparent that events occur that 
are not entirely predictable. The modeling of 
such events is often facilitated by employing 
random processes. The intent of this paper is to 
consider and to illustrate a subclass of the set of 
stochastic processes one known as discrete 
time Markov models. This subclass is termed 
discrete time because time is indexed in finite 
steps rather than as a continuum, Diameter 
class distributions in forest stands will be used 
to illustrate the process. 

A Markov chain is a discrete time stochastic 
process consisting of a sequence of random 
events { X, , X 3 ,. X 3 , . . .} , each with a finite 
number of possible outcomes { ai, BI, . . ., a r } 
{Phillips et al 1976) This set of possible out- 
comes is termed state space. The value assumed 
by X, is called the state of the process at time i. 
It is assumed that the random variable, X ( , 
depends only upon the previous event X,_! , and 
affects only the subsequent one, X, t i . This as- 
sumption, known as the Markov assumption, 
avoids having to express joint distributions for 
all events at one time. Instead, it is sufficient to 
express conditional distributions of just two 
neighboring random variables at a time. Mar- 
kov's assumption simplifies the problem but 
does not completely eliminate the dependence 
between random variables. Therefore life pro- 
cesses can be realistically represented. 

Given that X = a ( is the present state of a 
process, one desires to know the probability dis- 
tribution of the possible outcomes for Xi .These 
transitional probabilities are designated p,,, 
which represent the conditional probability of 
going from state a, to state a^ after one step or 
transition, i.e., P{ Xi -&j\ X = a,}. 



A conventional means of exhibiting transition 
probabilities is with a square matrix. A transi- 
tion matrix for a three-state process may be 
represented as: 



P - 



pa, i 

p3,l 



Because each row of the above matrix repre- 
sents a probability distribution forXi, Zp y -l 
for each i. 

The matrix P completely describes the process 
for any given outcome at X, given its initial 
dependence upon X . If event Xi is described, 
the probability distribution for X a must be de- 
veloped. By the Markov assumption, the next 
state, X 2 in this instance, depends only upon the 
previous state, X ( . However, X, depends upon 
X . Three mutually exclusive paths exist to pro- 
gress from Xo - ai to X a = a! (fig. 1). The prob- 
ability of being in state ai at time 2 given that 
the process was in state ai at time o may be 
represented as the sum of the three "path proba- 
bilities"; 



P{ Xo=a 1 ->X a -a,} = 

P{ X =a,-X 1 = a,-X 
+ P{ X =a,->X 1 =a 1 -X 
+ P{ X -a.-^X, =a,-*X 3 = 



(1) 



By employing the Markov assumption and the 
matrix of -one-step transition probabilities, 
Equation (1) can be rewritten as: 



P{ 



PI,, P{ X, -a.-Xi -a,} 
p,, a P{ X, =a 2 -X 1 -aj 

Pi,a P{ Xi 83-^X2 =8[}. 
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Figure 1. Possible transitions from X 
toXi =fli for a three-state process. 



The probability for the first leg of each path is 
readily obtained from the matrix of one-step 
probabilities. The probabilities for the second 
step of each path is really the transition proba- 
bility associated with the second transition. At 
this point an additional simplifying assumption 
is needed. If it is assumed that the transitional 
probabilities do not change with time, then: 



P{ X - a, -*Xi = a, } = PL, PI,, + p tll pi,, 



(2) 



The assumption is known as stationarity; with- 
out it the process would not be able to continue 
because a new transition matrix would be needed 
for each transition. Using similar logic, probabil- 
ities may be developed for other states at Xi 
given that X = ai : 



P{ 



pi.ipi.i + Pi.apj.1 + 



=pi,ipi,3 + pi.apa.3 



(3) 



Thus, Equations (2), (3), and (4) specify the 
probability _ distribution of X 2 , given that 
X = a, . Similarly, other expressions may be 
developed to specify the probability distribution 
for X, given that X = a, or X = a 3 . The nota- 
tion p,/ 1 1 is utilized for the individual elements 
of the two-step transition matrix and the com- 
plete two-step matrix, P<, for the three-state 
process may be represented as: 



Pi 2 



[2 ) 
Pi 3 

(2 ) 
P2,3 



Pa.i 



p3,l 



As indicated above, the individual elements 
are calculated from the one-step matrix: 

Pu'-Pl.lPl.! +Pl,lPV + P1.3P3.I. 

The two-step transition matrix is obtained by 
squaring the one-step matrix: 

p(i)=p.p = P* (5) 

The relation expressed in Equation (5) is only 
possible by use of the Markov and stationarity 
assumptions. 

To obtain the distributions of X 3 given var- 
ious possible states of X , it is again necessary 
to consider possible paths that lead from X to 
X 3 in a similar fashion as was illustrated for X 2 . 
Following that logic, the elements of the three- 
step transition matrix can be written in terms of 
the two-step and one-step probabilities: 



PI, i -Pi.ipi.i 



+Pi,iPi,i + 



In matrix notation that is: 

The n-step transition matrix is equal to the one- 
step transition matrix raised to the n" 1 power, 
P""P". (6) 

The relation expressed by Equation (6) is the 
most important result for Markov chains be- 
cause it gives the probability of being in a parti- 
cular state at a particular time, given the initial 
state. 

To illustrate the use of the discrete time Mar- 
kov model, data were obtained from permanent 
growth plots in a central Wisconsin mixed hard- 
wood stand that had been measured for diameter 
for 19 consecutive years. The 26 states chosen 
for the model are the 1-inch diameter classes 
from 8-inch through 29-inch, a diameter class 
called > 29 inch, and the categories of death and 
harvest. The 8-inch class was the lowest diam- 
eter class in which all trees were measured and 
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31-inch was the largest diameter of any tree after 
19 years. Because only the first nine growth 
periods were utilized to develop the model, it 
was necessary to define the largest diameter 
class as > 29 to obtain numerical values for the 
transition probabilities. 

A tree in any state representing a diameter 
class may either remain in that class, move to a 
higher class, die, or be harvested. The dead and 
harvest states are considered absorbing states 
because once a tree has entered either one of 
these states it cannot leave it. The 1-inch diam- 
eter classes are termed transient states. 

The information to determine the probabilities 
for transitions between states and to verify the 
predictive ability of the model was obtained by 
summarizing the progression of the initial trees 
in each diameter class for the 19-year period 
(table 1). Ingrowth trees were not included in the 
summaries. 

If a tree was in the 10-inch diameter class 9 
years ago, the transition probabilities associated 
with its movement are determined as follows: (1) 
the tree will remain in the 10-inch class, pi,o,i,it, 
- 44/139; (2) the tree will advance to the 11-inch 
class, pi,o, 1,1 , = 81/139; (3) the tree will move 
into the 12-inch class, pi, , 1,2, = 5/139; (4) the 
tree will die, p ll0 , m = 8/139; or (5) the tree will 
be harvested, pi, IC = 1/139. 

The transition probabilities for the other diam- 
eter classes are similarly determined with the 
respective diameter class progression sum- 
maries. Because death and harvest are absorbing 
states, p m , m =p c , e =1 (table 2). 



As indicated by Equation (5), the two-step 
transition probabilities are obtained by squaring 
the initial transition matrix. The numbers p,/ a > 
for i and j = 8 ... 29, > 29, M 1 , and C given the 
probability of a tree being in state j given that it 
was in state i two steps ago (table 3). 



(0 I 



Ift< 01 =(t 8 ' 01 ,t 9 ' l W',t 

t ffl (0 ', t c l >) is a vector whose elements corres- 
pond to the initial number of trees in each state, 
then the matrix of two-step transition probabili- 
ties can be used to predict the disposition of 
those initial trees after 18 growing seasons. The 
numbers p, m " ' and p,, (1 ' for i - 8 ... 29, > 29 
are, respectively, the death and harvest proba- 
bilities for a tree given that it was initially in the 
diameter class. The predicted number of mor- 



tality trees, m,, and harvested trees, c,, are: 



p,, 



(7) 



and 



= t, (0 > -p,.' 11 for i = 8... 29, > 29. (8) 



Because Equations (7) and (8) account for 
death and harvest, the number of surviving 
trees, s,, from each initial diameter class is 

s, = ti<" .(l-P**"-'-?!.' 11 ) 0) 

for i = 8... 29, > 29. 

Equation (9) indicates the number of trees in the 
i" 1 diameter class expected to survive. It does 
not indicate the distribution of the surviving 
trees. 

To determine the diameter distribution for sur- 
viving trees it is necessary to sum all the waya 
that a tree can enter a diameter class, regardless 
of its initial class: 

t/=*V 01 .p,/ 2 ' (10) 



1 M = dead trees; C = harvested trees. 



Table 1. The progression of all trees initially in the 10-inch diameter class 

by number of trees and change in diameter class 

(In number of trees) 





Measurement 
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2 


: 3 


: 4 


: 5 


i 6 


i 7 


: S 


; 9 


: W 


! 1 1 


! 12 
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14 


! 15 


: 16 


t 17 


; ItJ 


i 19 


10 




133 


115 
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98 


83 


69 


62 


61 


44 


41 


35 


29 


25 


22 


19 


16 


15 


13 


11 


n 


5 


23 


31 


37 


50 


61 
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68 


81 


75 


75 


69 


&7 


06 


67 
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where t/ m is the number of trees in state j after 
two transitions, regardless of the initial state. 

If the values obtained from Equation (10) are 
defined in vector form as 

t^'Hta' 1 '^' 2 ',..., W'.t^g' 11 ,^' 2 ', 
t c (1 ' ), then the equation 

t<M = t |0 > p< 21 
may be used to obtain the individual elements. 

The elements t n , (2} and t B ' 2 ' of t (1 ' are equiv- 
alent to the m, and c, from Equations (7) and 
(8), respectively. The total number of surviving 
trees equals 

?s, = H, (2) for i = 8...29,>29. 
Thus, this value may be determined as the sum 
of the elements from Equation (9) or from the 
sum of all elements of the vector t (2) except 
death and harvest. 

The foregoing relations may be utilized to pre- 
dict diameter distributions. We demonstrated 
the predictive ability of the model by using data 
collected from a stand for 19 consecutive years 
(table 4). The observed and predicted values 
agree closely; however, the number of surviving 
trees by initial diameter class is more accurately 
predicted than the distribution of surviving trees 
(Bruner and Moser 1973). 



In another study (Cassell and Moser 1974) the 
trees were grouped into six tolerance classes and 
each class was individually modeled to provide 
information on species composition and diameter 
distribution (tables 6 and 6). The results from 
this study are similar to those of the composite 
model. In general the predicted number of sur- 
viving trees by species group was accurate and 
the prediction of death, harvest, and future 
diameter distribution somewhat less accurate. 

For both of the above studies, the basic data 
were utilized to develop additional models with 
transition periods of 4, 5, 6, 7, and 8 years. 
Predictions obtained with these models were 
similar to those of the 9-year model. Further, it 
was indicated that the accuracy of prediction 
was not dependent upon the length of the predic- 
tion period but that predictions beyond one per- 
iod were less accurate. 

To utilize the Markov process, the Markov and 
stationarity properties must be satisfied. First, 
to predict the next state the present state must 
be known. Second, the transition probabilities 
between two specific states must remain con- 
stant. In regard to the dynamics of diameter 
distributions, these properties imply that; (1) 
the diameter distribution some time in the future 



Table 4. Observed and predicted stand values at measurement 19 using 

composite model 
(In number of trees) 



d.b.h. class 


Initial 


: Survivors 


rDlameter Distribution: 


Mortality 


Harvested 


(inches) 


no. trees 


: Actual : 


Predicted 


i Actual i 


Predicted : 


Actual ; Predicted 


Actual : 


Predicted 


8 


209 


175 


165 


22 


28 


29 


37 


5 


6 


9 


148 


117 


117 


97 


98 


23 


26 


8 


5 


10 


139 


122 


122 


116 


109 


15 


14 


2 


3 


11 


131 


110 


113 


116 


108 


14 


12 


7 


6 


12 


111 


91 


90 


94 


91 


10 


14 


10 


8 


13 


107 


85 


86 


72 


92 


14 


17 


8 


4 


14 


88 


76 


75 


85 


79 


7 


8 


5 


5 


15 


86 


73 


70 


74 


75 


4 


8 


9 


7 


16 


74 


54 


55 


75 


73 


8 


11 


12 


8 


17 


64 


50 


48 


67 


66 


8 


9 


6 


7 


18 


52 


35 


35 


67 


56 


6 


7 


11 


10 


19 


32 


19 


20 


51 


45 


3 


3 


10 


9 


20 


26 


19 


19 


24 


35 


1 





6 


7 


21 


14 


11 


11 


36 


26 


1 





2 


3 


22 


8 


5 


5 


16 


24 








3 


3 


23 


9 


'5 


7 


18 


6 








4 


2 


24 


6 


3 


4 


7 


11 


1 





2 


2 


25 


3 


3 


2 


8 


7 





1 





1 


26 


6 


3 


2 


2 
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1 


2 


2 


2 


27 


5 


2 
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4 


2 


3 


1 


1 


28 


4 


1 


2 


2 





2 


3 


1 





29 


4 


3 


1 





4 


1 


3 








> 29 


1 


o. 





6 


3 


1 


1 








Totals 


1,327 


1,062 


1,051 


1,062 


1,049 


151 


179 


114 


98 



26 



Table 5. Observed and predicted survivor distributions at measurement 

19 using tolerance class model 
(In number of trees) 



clnSB 


: Corapoalte 


i 




Final dlacc 


_bu tlon a 










{inches) 

e 

9 
10 
11 

12 
13 


i Initial 

209- 
US 
139 
131 
111 
107 


-. Final ! 

22 
97 

116 
116 
94 
72 


Predicted 

29 
9-9 
110 
111 
90 
92 


: Actual 

1 
5 
3 

12 
7 
4 


Fredictc 

1 
7 
6 
9 


_J.iActualiPr<iJUt_ed_ 

1 
13 12 
17 18 
21 27 
24 22 
15 17 


.Agtiiitlf Predicted 

1ft 23 
75 76 
85 83 
76 66 
51 50 
44 63 


:ActuaL:Frdictedi Actual iPrfidictai 

1100 
2200 
7100 
3400 
6 10 
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2 
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4 
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13 
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a 
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BO 
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46 
36 
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5 
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5 
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5 
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9 
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13 
7 
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47 
52 

41 
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27 
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3 
4 
6 
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9 
4 
1 
3 
1 
2 
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5 
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6 

7 



6 
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1 
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2 
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6 
7 
2 
6 
2 
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9 
5 
6 
4 
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1 
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Totals 


1,327 


1,062 


1,062 


64 


64 


167 


176 


737 


73fi 


57 


46 


1 
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Table 6. Observed and predicted mortality 
and harvest at measurement 19 using tolerance 

class model 
(In number of trees) 



Measurement 19 


Class : 




Mortality 


: Harvested 


: Actual : Predicted 


: Actual 


: Predicted 


1 





1 








2 


16 


7 


13 


9 


3 


31 


44 


50 


38 


4 


9 


19 


2 


4 


5 





1 








6 


1 





1 


1 


Totals 


57 


72 


66 


52 



depends only upon the distribution now and not 
upon past distributions; and (2) the probability 
of a tree moving, for instance, from the 8-inch to 
the 9-inch class in any specific period must re- 
main the same regardless of stand conditions, 

The larger discrepancies for predictions be- 
yond one period may be attributable to not satis- 
fying the stationarity assumption. To examine 
this possibility, the 19 years of remeasurement 
data were used to determine transition probabili- 
ties between the various states over time. These 
probabilities were fairly constant for diameter 
classes with a large initial number of trees, but 
this was not true for diameter classes with a 



small initial number of trees. This suggests that 
the accuracy of predictions for several periods is 
dependent upon good estimates of the transition 
probabilities which, in turn, are dependent upon 
sufficient data for all diameter classes. Also, be- 
cause uneven-aged stands are characterized by a 
large number of trees in the lower diameter 
classes, permanent plot data from such stands 
will have a similar diameter distribution. This 
situation will inherently lead to greater accuracy 
in prediction in the lower classes. 

Prediction of diameter distributions with the 
Markov model has both positive and negative 
points. One disadvantage is that mortality and 
harvested trees are predicted as numbers of trees 
by original diameter class so that the actual 
diameter class of a tree when it dies or is har- 
vested is not known. Another disadvantage is 
the difficulty of introducing ingrowth into the 
process. The only way to allow for ingrowth is to 
inventory trees in diameter classes below the 
lower limit for which predictions are important. 
For example, if predictions with ingrowth are 
desired for sawtimber trees in the 12-inch diam- 
eter classes and above, trees in the 8-, 9- ( 10-, and 
11-inch diameter classes at the initial measure- 
ment can be considered as possible ingrowth into 
the 12-inch and larger classes during the predic- 
tion periods. To include ingrowth in all diameter 
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classes 8 inches and above, trees in roughly the 
4-, 5-, 6-, and 7-inch diameter classes would have 
be inventoried. A third disadvantage is the lack 
of flexibility in the length of prediction periods. 
If the two inventory measurements used for the 
prediction are, for example, 5 years apart, pre- 
dictions of stand conditions can only be made for 
subsequent multiples of 5 years. Lastly, at least 
two measurements from permanent plots are re- 
quired as data for predictions. This prediction 
method is not applicable for areas not under a 
continuous forest inventory system. 

Ease of application is a major benefit of the 
Markov model. Accurate predictions of numbers 
of survivor, dead, and harvested trees, and the 
distribution of surviving trees depend only upon 
conventional continuous forest inventory data 
and a knowledge of elementary matrix opera- 
tions. 
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Most forest inventories begin with the classi- 
fication of points selected from individual aerial 
photographs. The information developed from 
this procedure is subject to several sources of 
error. Photo cover type classification of a sample 
point can be in error due to the quality and age 
of the photos or to the skill of the photo inter- 
preter. These errors can be adjusted by field 
checking a portion of the photo points. A source 
of error that cannot be corrected by field check- 
ing occurs when end lap (overlap), sidelap, crab, 
and tilt distort the photo coverage of the land 
area. This results in some areas being sampled 
more intensively than others, 

The normal forest aerial photography con- 
tracts specifies 60 percent endlap and 30 percent 
sidelap in line of flight. However, the contractor 
usually delivers more coverage of the area to be 
sure that these requirements are met. This type 
of bias is recognized by aerial photo interpreters, 
but usually ignored in the hope that the errors 
will be compensating. Contract specifications for 
aerial photos also require a specific scale the 
deviation from specified flight altitudes shall not 
exceed 300 feet for the purchaser to accept the 
photos. 

We tested photo intensity during the forest 
survey in Iowa where photographing conditions 
were good - differences in elevation were mod- 
erate and each township had many roads lying 
north-south and east-west In spite of these ideal 
conditions, photo overlap was 0.66 percent great- 
er than specified, which represented 237,000 
acres. This means that in sampling individual 
photos, without regard to overlap variation, area 



of the predominant land class would have been 
overestimated. A similar test done by the Pacific 
Northwest Forest and Range Experiment Sta- 
tion in Oregon and Washington had similar 
results (Pope etal. 1972). 



PROCEDURES USED AT 

OTHER STATIONS 

The PNW selects plot locations on maps, then 
transfers these plot locations to aerial photos 
using a radial line plotter or stereopotter. This 
procedure is considered too expensive by many 
inventory foresters. 

The Southern Forest Experiment Station 
locates sample plots on the most recent conven- 
tional aerial photographs and establishes them 
on the ground. Then assigns geographic coor- 
dinates to the nearest mile for each plot using a 
coordinatograph with a 7V4-minute quadrangle. 
The cost of this method is considered rather high 
also. 

The Southern Region establishes forest sur- 
vey field plots on maps by a systematic grid. 
Then transfers these locations to aerial photos 
and establishes a cluster of photo plots on each 
photo containing a field plot. 

The Northeastern Region locates samples 
directly on individual conventional photos. Then 
arranges samples systematically using three 
photo plots per photo print and randomly 
chooses ground plots by photo class. 
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A new way was needed to use these photos 
that would estimate forest area accurately. For- 
est Survey at the North Central Station ap- 
proached the problem with the objectives to: (1) 
select plot locations on aerial photos that would 
avoid bias due to uneven photo intensity, (2) 
determine how to select the proper grid scale to 
represent the land area sampled, and (3) keep 
costs down. The method we developed is des- 
cribed below. 



The township mosaic assembly corresponds 
closely to township area on the ground. To avoid 
too many of the plots falling on roads (north- 
south and east-west), the grid is turned 5 de- 
grees to the left. In the eastern part of the 
United States where land area is not divided by 
the township and range system, other controls of 
land area could be used such as geological. sur- 
vey contour maps (U.S.G.S. quads). 



NORTH CENTRAL 
PROCEDURE 

A township mosaic is assembled from individ- 
ual conventional photos for each township in our 
Region instead of using single individual con- 
ventional photos. Next, township boundaries are 
transferred from maps of plat books to the 
mosaic (thus providing the gound control of the 
area). This compensates for overlap, sidelap, 
crab (apparent sidewise motion of an airplane 
headed into a cross-wind), and tilt (departure 
from the vertical position of the camera). The 
time spent to assemble one township is Vi to 1 
hour. Then a systematic grid of plots is placed 
over the township mosiac. Therefore, sample 
plots are uniformly spaced and an equal number 
of plots is represented in each township. 

In our Region differences in relief are not great 
and contracting companies usually deliver 
photos in the prescribed range, but the scale of 
photos varies. To compensate for this, we have 
many sets of grids with variations of scale from 
1:15,000 to 1:42,000. The appropriate grid is 
selected to match the scale of the photo mosaic. 



RESULTS AND DISCUSSION 



The assembled mosaic system allows us to 
locate plots systematically on the aerial photos, 
which minimizes the bias due to photography in 
photo plot sampling. We still cannot entirely 
eliminate small differences of relief and some dis- 
tortion on the edge of the photos, but we do 
eliminate costs of the map-ground transfer 
procesa and believe that our statistical results 
are realistic, 
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A means of estimating diameter at breast 
height (d.b.h.) from stump measurements has 
several applications: (1) the volume removed in 
cases of timber tresspass can be calculated using 
local volume tables, (2) diameter distributions 
and basal areas for sample plots or stands can be 
calculated if trees have been cut and the data 
were not previously recorded or were lost, and 
(3) removals from a large forest area can be cal- 
culated. The equations presented here were 
developed as part of the 1977 Minnesota forest 
inventory. The d.b.h. estimated by these equa- 
tions was used as an input to other volume equa- 
tions to make an estimate of the timber removed 
from Minnesota's forests, 

Previous studies of d.b.h. and stump diameter 
relations in other regions of the country have 
included few tree species of interest in the north- 
central States or have not been done in a form 
easily adaptable to forest inventory computer 
processing. Several of these studies were pub- 
lished only as tables or charts (Rapraeger 1941, 
Cunningham et al. 1947), while others were 
based on a linear relation from a fixed stump 
height (Bones 1960, 1961). Stump height was 
used as a variable by Curtis and Arney (1977) 
for estimating d.b.h. of second-growth Douglas- 
fir in the Pacific Northwest, McClure (1968) used 
stump height in a model similar to the model 
used in our study but his equations and tables 
cover species found in the southeast. 

METHODS 

We collected data from 2,576 trees. These data 
were collected as part of forest product utiliza- 
tion studies conducted in conjunction with forest 



inventories in Michigan, Wisconsin, and Minne- 
sota. Measurements were taken from random 
samples of felled treea at logging operations in 
these States. Along with the d.b.h. for each tree, 
the diameter outside bark (d.o.b.) was measured 
to the nearest 0.1 inch at half-foot intervals from 
0.5 to 2.6 feet above the ground (table 1). If an 
abnormality, such as a bulge or fork, occured at 
the measurement height, the measurement was 
not taken. 



Table 1. D.b.k, regression coefficients for tree 
species of the Lake States 
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RESULTS 

Because the best estimate of d.b.h. is obtained 
by measuring stump diameter at the highest 
point on a given stump and stump heights vary 
so greatly, stump height was chosen as an inde- 
pendent variable along with d.o.b. Plotting the 
ratios of d.b.h. to d.o.b. suggests a model of the 
form: 



HARDWOODS 



Oaks 



d.b.h. 
d.o.b. 



ln(H)+C- d.o.b. -H 



where, A,B, & C = regression parameters, and 
H - stump height at which d.o.b. was measured. 

Then, we modified the equation as follows to 
make it usable for stump heights ranging from 
to 4.5 feet, because the natural logarithm of zero 
is undefined. 



d.b.h. 
d.o.b. 



-1 + A + B (ln(H + 1.0) = In 5.5) 
+ C d.o.b. (H-4.B) 



where, A = the regression coefficient for a given 
species group, B + C = regression coefficients, 
H = stump height in feet, d.o.b. = stump diam- 
eter outside bark in inches at H, and d.b.h. = 
diameter at breast height in inches. 

This modified regression model was fit using 
multiple linear regression with species groups as 
a dummy variable. The species included in each 
species group are listed below. 



Common name 



Scientific name 



Eastern white pine 
Red pine 
Jack pine 
White spruce 
Black spruce 
Balsam fir 

Tamarack 

Northern white-cedar 

Other softwoods 



SOFTWOODS 



Pinus strobus 
Pinus resinosa 
Pinus banksiana 
Picea glauca 
Picea mariana 
Abies balsamea var. 

balsamea 
Larix laricina 
Thuja occidentalis 
Juniperus virginiana 
all other softwoods 



Quercus alba 
Quercus bicolor 
Quercus macrocarpa 
Quercus rubra 
Quercus ellipsoidalis 
Carya cordiformis 
Gary a ouata 
Betula alleghaniensis 
Acernigrum 
Acer sacckarum 
Acerrubrum var. 

rubrum 

Acer saccfiarinum 
Fraxinus americana 
Fraxinus nigra 
Fraxinus pennsyluanica 
Populus balsamifera 
Betula papyrif era var. 

papyrifera 

Populus grandidentata 
Populus tremuloides 
Tilia americana 
Ulmus americana 
Ulmus rubra 
Ulmus tkomassii 
Juglans cinerea 
Juglans nigra 
Primus serotina 
Acernegundo 
Betula nigra 
Celtis occidentalis 
Populus deltoides 
Noncommercial species 

The R 1 for the regression is 0.64616 and the 
standard error of estimate is 0.5966. The A + 1 
coefficients B and C equal 0.1273 and 0,001641, 
respectively. 

Figure 1 illustrates the advantages of this 
model, which increases the taper in the lower 
section of the stump for large trees. For ex- 
amples, the data for northern white-cedar, bal- 
sam poplar and bigtooth aspen, and ash have 
been put in table form (tables 2, 3, and 4). These 
tables may be used to find the estimated d.b.h.'s 
for these trees when the stump height and stump 
d.o.b. are known. When the stump diameter is in 
other than 1-inch increments, interpolate to esti- 
mate d.b.h. A graph could be used in the field as 
a quick method of estimating d.b.h, (fig.2). 



Red oak 

Northern pin oak 
Hickories 

Yellow birch 
Hard maples 

Soft maples 



Ashes 



Balsam poplar 
Paper birch 

Bigtooth aspen 
Quaking aspen 
American basswood 
Elms 



Select hardwoods 



Other hardwoods 



31 




10 IS 20 

STUMP DOB (INCHES) 



30 



Figure 1. The relation of stump height and 
stump d. o. b, for 10- and 20-inch d. b. h. red pine. 



Species group ) 


Trees ; 


Observations 


fl.b.h. range 


Coefficient 
A 4 1.0 




fhmb 


or 1 


JnoJie-a 




White pine 


34 


13B 


9.4-33.0 


1.07035 


Red pine 


98 


392 


3.1-22.6 


.05462 


Jack pine 


474 


1916 


4.9-19.4 


.04608 


Htitte spruce 


51 


117 


5.0-U.4 


.01082 


Black spruce 


124 


126 


5,0-13.3 


.02679 


Balsam ffr 


139 


532 


4.2-15.6 


.01367 


Tamarack 


7 


14 


7.2-9.2 


1 .01752 


Northern 










wh1te-Jar 


97 


194 


5,2-17.2 


, .95610 


Other softwoods 


__ 


--. 




M .04608 


White oak 


34 


137 


10.6-22,9 


1 .04427 


.Red S northarn 










pin oak 


107 


425 


6. 3-30. & 


1 . 056SS 


Hickory 











*1 . 0565& 


Yellow birch 


22 


47 


13.2-23.8 


1.10481 


Hard maple 


Q9 


239 


6,7-24.2 


1.05199 


Soft maple 


27 


114 


8,0-24.2 


1.05199 


Ash 


31 


218 


7.4-24.7 


1 . 05559 


Balsam poplar) 










Slgtooth aspen 


187 


5B2 


5,0-17.8 


1.07697 


Paper birch 


189 


795 


5'. 0-1 3.6 


1.05155 


tluakfng aspen 


738 


2706 


5.0-20.5 


1.06439 


Basswond 


25 


99 


9.S-26.7 


1,07428 


Elm 


48 


174 


7.0-30.5 


1,06734 


Select hardwoods 


4 


ZD 


8.1-11.7 


1.10929 


Other hardwoods 


_. 







3 1. 06439 


NoncoiTTieircial 










sp&ctes 


-- 


-- 


-- 


3 1, 06439 


TOTAL 2 


,575 


9,2a? 


3.4-33,0 






a llsed Uie value for Jack pine. 
Used t&& value for red oak. 
Used the value for quaking aspen, 



Table 2. Estimated d. b. h, for northern white- 
cedar from stump height and d.o.b. 
(In inches) 



Stump 
d.o.b. 


: Stump height (feet) 


: 0.5 


: l.fl 


: 1.5 


: 2.0 


5 


3.8 


4.0 


4.2 


4.3 


6 


4.5 


4.8 


5.0 


5.1 


7 


5.2 


5.5 


5.7 


6.0 


8 


5.9 


6.3 


6.5 


6.8 


9 


6.6 


7.0 


7,3 


7.6 


10 


7,3 


7.7 


8.1 


8.4 


11 


7.9 


8.4 


8.8 


9.2 


12 


8.5 


9.1 


9.6 


10.0 


13 


9.2 


9.8 


10.3 


10.7 


14 


9,8 


10.5 


11,0 


11.5 


15 


10.4 


11.1 


11.7 


12.3 


16 


11.0 


11.8 


12.4 


13.0 


17 


11.5 


12.4 


13.1 


13.8 


18 


12.1 


13,0 


13.8 


14,5 


19 


12.7 


13,6 


14.5 


15.2 


20 


13.2 


14.2 


15.1 


15.9 


21 


13.7 


14.8 


15,8 


16,6 


22 


14.2 


15.4 


16.4 


17.4 


23 


14,7 


16.0 


17.1 


18.0 


24 


15,2 


16,5 


17.7 


18.7 


25 


15.7 


17.1 


18.3 


19.4 


26 


16.1 


17,6 


18.9 


20.1 


27 


16.6 


18.2 


19.5 


20,7 


28 


17.0 


18,7 


20.1 


21.4 


29 


17.4 


19,2 


20.7 


22.0 


30 


17.8 


19.7 


21,2 


22.7 


31 


18.2 


20.1 


21.8 


23.3 


32 


18.6 


20.6 


22.3 


23.9 


33 


18.9 


21.0 


22.9 


24.5 


34 


19.3 


21.5 


23.4 


25.1 


35 


19.6 


21.9 


23.9 


25,7 


36 


20.0 


22.3 


24.4 


26.3 



Figure 2. D, b, h, for red pine as a function of 
stump d t o.b. 



32 



Table 3. Estimated d. b, h. for bigtooth aspen 

and balsam poplar from stump height and do. 6. 

( In inches) 



Table 4. Estimated d. b. h. for ash from stump 

height and d.o. b. 

(In inches) 



Stump 
d.o.b. 


: Stump height (feet) 


: 0.5 : 


1.0 : 


1.5 : 


2.0 


5 


4.4 


4.6 


4.8 


4.9 


6 


5.2 


5.5 


5.7 


5.9 


7 


6.1 


6.4 


6,6 


6.8 


8 


6.9 


7.2 


7.5 


7.8 


9 


7.7 


8.1 


8.4 


8.7 


10 


8.5 


8.9 


9.3 


9.6 


11 


9.3 


9.8 


10.2 


10.5 


12 


10.0 


10.6 


11.0 


11.4 


13 


10.8 


11.4 


11.9 


12.3 


14 


11.5 


12.2 


12.7 


13.2 


15 


12.2 


13.0 


13.6 


14.1 


16 


12.9 


13.7 


14.4 


15.0 


17 


13.6 


14.5 


15,2 


15.8 


18 


14.3 


15.2 


16.0 


16.7 


19 


15,0 


16.0 


16.8 


17.6 


20 


15.6 


16.7 


17.6 


18.4 


21 


16.3 


17.4 


18.4 


19.2 


22 


16.9 


18.1 


19.1 


20.1 


23 


17.5 


18.8 


19.9 


20.9 


24 


18.1 


19.5 


20.7 


21.7 


25 


18.7 


20,2 


21.4 


22.5 


26 


19.3 


20.8 


22.1 


23.3 


27 


19,9 


21.5 


22.8 


24,1 


28 


20.4 


22.1 


23.5 


24.8 


29 


21.0 


22,7 


24.2 


25.6 


30 


21.5 


23.3 


24.9 


26.4 


31 


22.0 


23.9 


25.6 


27.1 


32 


22.5 


24.5 


26.3 


27.9 


33 


23.0 


25.1 


26.9 


28,6 


34 


23,5 


25,7 


27.6 


29.3 


35 


23.9 


26.2 


28,2 


30.0 


36 


24.4 


26.8 


28.8 


30.7 



Stump : 


Stump he1 


ght l feet) 


d.o.b. : 


0.5 : ' .0 


: ' .5 : 2.0" 


5 


4.3 4.5 


4.7 4.8 


6 


5.1 5.4 


5.6 5.7 


7 


5.9 6.2 


6,4 6,6 


8 


6.7 7.0 


7,3 7.6 


9 


7.5 7.9 


8.2 8.5 


10 


8.2 8.7 


9.1 9.4 


11 


9.0 9.5 


9.9 10.3 


12 


9.7 10.3 


10.8 11.2 


13 


10.5 11.1 


11.6 12,0 


14 


11.2 11.8 


12.4 12.9 


15 


11.9 12.6 


13,2 13,8 


16 


12.6 13.4 


14.0 14,6 


17 


13.2 14.1 


14,8 15,4 


18 


13.9 14.8 


15,6 16,3 


19 


14.5 15.5 


16.4 17,1 


20 


15.2 16.2 


17.1 17,9 


21 


15.8 16.9 


17.9 18,7 


22 


16.4 17.6 


18,6 19,5 


23 


17.0 18,3 


19,4 20,3 


24 


17.6 18.9 


20,1 21,1 


25 


18.2 19.6 


20.8 21,9 


26 


18.7 20.2 


21.5 22.7 


27 


19.3 20.8 


22.2 23.4 


28 


19.8 21.4 


22.9 24.2 


29 


20.3 22.0 


23.6 24.9 


30 


20,8 22.6 


24.2 25.7 


31 


21.3 23.2 


24.9 26.4 


32 


21.8 23.8 


25.5 27.1 


33 


22.2 24.3 


26.2 27,8 


34 


22.7 24,9 


26.8 28.5 


35 


23.1 25.4 


27.4 29.2 


36 


23.5 25.9 


28.0 29,9 
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